PrOntoLearn: Unsupervised Lexico-Semantic Ontology Generation using Probabilistic Methods

نویسندگان

  • Saminda Abeyruwan
  • Ubbo Visser
  • Vance P. Lemmon
  • Stephan C. Schürer
چکیده

Formalizing an ontology for a domain manually is well-known as a tedious and cumbersome process. It is constrained by the knowledge acquisition bottleneck. Therefore, researchers developed algorithms and systems that can help to automatize the process. Among them are systems that include text corpora for the acquisition. Our idea is also based on vast amount of text corpora. Here, we provide a novel unsupervised bottom-up ontology generation method. It is based on lexico-semantic structures and Bayesian reasoning to expedite the ontology generation process. We provide a quantitative and two qualitative results illustrating our approach using a high throughput screening assay corpus and two custom text corpora. This process could also provide evidence for domain experts to build ontologies based on top-down approaches.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Unsupervised Approach for Semantic Relation Interpretation

In this work we propose a hybrid unsupervised approach for semantic relation extraction from Italian and English texts. The system takes as input pairs of “distributionally similar” terms, possibly involved in a semantic relation. To validate and label the anonymous relations holding between the terms in input, the candidate pairs of terms are looked for on the Web in the context of reliable le...

متن کامل

A lexico-semantic pattern language for learning ontology instances from text

The Semantic Web aims to extend the World Wide Web with a layer of semantic information, so that it is understandable not only by humans, but also by computers. At its core, the Semantic Web consists of ontologies that describe the meaning of concepts in a certain domain or across domains. The domain ontologies are mostly created and maintained by domain experts using manual, time-intensive pro...

متن کامل

An Intelligent Approach for Constructing Domain Ontology Using Art2 Neural Network and C-Value Method

Research on semantic webs has become increasingly widespread in the computer science community. The core technology of a semantic web is an artefact called an ontology. The major problem in constructing an ontology is the long period of time required. Another problem is the large number of possible meanings for the knowledge in the ontology. To overcome these problems, one approach is developin...

متن کامل

SEMILAR: A Semantic Similarity Toolkit for Assessing Students' Natural Language Inputs

We present in this demo SEMILAR, a SEMantic similarity toolkit. SEMILAR includes offers in one software environment several broad categories of semantic similarity methods: vectorial methods including Latent Semantic Analysis, probabilistic methods such as Latent Dirichlet Allocation, greedy lexical matching methods, optimal lexico-syntactic matching methods based on word-to-word similarities a...

متن کامل

Ontology Enrichment for the Food Traceability Domain Using Romanian Lexico-syntactic Patterns

Ontologies are considered as the most important building blocks of semantic Web. Building such ontologies is a time consuming and difficult task, which requires a high degree of human intervention. In this paper we describe a method to facilitate the enrichment of Romanian language domain taxonomies by using a text-mining approach. We exploit Romanian domain specific texts in order to automatic...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010